Efficient Nonlinear Control with Actor-Tutor Architecture
نویسنده
چکیده
A new reinforcement learning architecture for nonlinear control is proposed. A direct feedback controller, or the actor, is trained by a value-gradient based controller, or the tutor. This architecture enables both efficient use of the value function and simple computation for real-time implementation. Good performance was verified in multi-dimensional nonlinear control tasks using Gaussian softmax networks.
منابع مشابه
Ecient Nonlinear Control with Actor-tutor Architecture
A new reinforcement learning architecture for nonlinear control is proposed. A direct feedback controller, or the actor, is trained by a value-gradient based controller, or the tutor. This architecture enables both e cient use of the value function and simple computation for real-time implementation. Good performance was veri ed in multi-dimensional nonlinear control tasks using Gaussian softma...
متن کاملA Virtual Cooperative Learning Environment Using Human Companion
This paper describes the architecture of an Interactive Learning Environment (ILE) on the internet using companions which one is a human and geographically distant of the learning site. The achieved system rests on a three-tier customer/server architecture (customer, web server, data and applications server) where human and software actors can communicate via the internet and uses the DTL learn...
متن کاملA novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
An online adaptive reinforcement learning-based solution is developed for the infinite-horizon optimal control problem for continuous-time uncertain nonlinear systems. A novel actor–critic–identifier (ACI) is proposed to approximate the Hamilton–Jacobi–Bellman equation using three neural network (NN) structures—actor and critic NNs approximate the optimal control and the optimal value function,...
متن کاملApplying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning
In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natural stochastic policy gradients while the critic obtains the natural policy gradient by linear regression. We show that this architecture can be used to learn the “building blocks of movement generation”, called motor ...
متن کاملREINFORCEMENT LEARNING AND OPTIMAL CONTROL METHODS FOR UNCERTAIN NONLINEAR SYSTEMS By SHUBHENDU BHASIN A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY
of Dissertation Presented to the Graduate School of the University of Florida in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy REINFORCEMENT LEARNING AND OPTIMAL CONTROL METHODS FOR UNCERTAIN NONLINEAR SYSTEMS By Shubhendu Bhasin August 2011 Chair: Warren E. Dixon Major: Mechanical Engineering Notions of optimal behavior expressed in natural systems led research...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996